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^ ■ Abstract 

^ I I The middle-third Cantor set C3 is a fractal consisting of all the points in [0, 1] 

' which have non-terminating base-3 representations involving only the digits and 2. 

We prove that all prime numbers p > 3 whose reciprocals belong to C3 must satisfy an 
equation of the form 2p-\-l = 3'' where q is also prime. Such prime numbers have base- 
3 representations consisting of a contiguous sequence of I's and are known as base-3 
, ^ , ■ repunit primes. We also show that the reciprocals of all base-3 repunit primes must 

belong to C3. We conjecture that this characterisation is unique to the base-3 case. 

^ ■ 

> : 

\o '. 1 Introduction 

^ : 

^ ' A prime number p is called a base-A^ repunit prime if it satisfies an equation of the form 

{N-i)p+i = N'^ (1) 

o ■ 

^ ! where A G N — {1} and where q is also prime. Such primes have the property that 



JS 



k=l 



SO they can be expressed as a contiguous sequence of I's in base A. For example, p = 31 
satisfies (1) for A = 2 and q = 5 and can be expressed as 11111 in base 2. The term repunit 
was coined by A. H. Beiler [1] to indicate that numbers like these consist of repeated units. 

More importantly for what follows, the reciprocal of any such prime is an infinite series 
of the form 

i = ^^ = V^ (3) 

p m-1 ^ m'' ^ ' 

^ fc=i 
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as can easily be verified using the usual methods for finding sums of series. Equation (3) 
shows that ^ can be expressed in base using only zeros and the digit — 1. This single 
non-zero digit will appear periodically in the base-A^ representation of ^ at positions which 
are multiples of q. 

The case N = 2 corresponds to the famous Mersenne primes for which there are numerous 
important unsolved problems and a vast literature [2]. They are sequence number A000668 in 
The Online Encyclopedia of Integer Sequences [3]. The literature on base-A^ repunit primes 
for A^ > 3 is principally concerned with computing and tabulating them for ever larger values 
of A^ and q. An example is Dubner's [4] tabulation for 2 < A^ < 99 with large values of q. 
Relatively little is known about any peculiar mathematical properties that repunit primes 
in these other bases may possess. 

In this paper we discuss one such property pertaining to base-3 repunit primes, i.e., 
those which satisfy an equation of the form 2p + 1 = 3"^ with q prime. They are sequence 
number A076481 in OEIS. We show that any prime number p > 3 whose reciprocal is in the 
middle-third Cantor set C3 must satisfy an equation of the form 2p + l = 3''. Conversely, the 
reciprocals of all base-3 repunit primes belong to C3. We conjecture that this characterisation 
is peculiar to the case A^ = 3 and discuss this at the end of the paper. 

For easy reference in the discussion below, it is convenient to give a name to prime 
numbers whose reciprocals belong to C3. A logical one is the following: 

Definition 1 (Cantor prime). A Cantor prime is a prime number p such that ^ G C3. 

The following is then a succinct statement of the theorem we wish to prove. 

Theorem 1. A prime number p is a Cantor prime if and only if it satisfies an equation of 
the form 2p + 1 = 3'^ where q is also prime. 

2 Proof of Theorem 

In order to prove Theorem 1 it is necessary to consider the nature of C3 briefly. It is 
constructed recursively by first removing the open middle-third interval (|, |) from the closed 
unit interval [0, 1]. The remaining set is a union of two closed intervals [0, |] and [|, 1] from 
which we then remove the two open middle thirds (|, |) and (|, §)• This leaves behind a set 
which is a union of four closed intervals from which we now remove the four open middle 
thirds, and so on. The set C3 consists of those points in [0, 1] which are never removed when 
this process is continued indefinitely. 

Each X G C3 can be expressed in ternary form as 

00 

— = O.aiaa ... (4) 

fc=i 

where all the are equal to or 2. The construction of C3 amounts to systematically 
removing all the points in [0, 1] which cannot be expressed in ternary form with only O's and 
2's, i.e., the removed points all have = 1 for one or more /c G N [5]. 
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The construction of the Cantor set suggests some simple conditions which a prime number 
must satisfy in order to be a Cantor prime. If a prime number p > 3 is to be a Cantor prime, 
the first non-zero digit a^^ in the ternary expansion of ^ must be 2. This means that for 
some ki E N, p must satisfy 

— <-< ^— (5) 

or equivalently 

e {2p, 3p) (6) 

Prime numbers for which there is no power of 3 in the interval {2p,3p), e.g., 5, 7, 17, 19, 
23, 41, 43, 47, . . . , can therefore be excluded immediately from further consideration. If the 
next non-zero digit after a^^ is to be another 2 rather than a 1, it must be the case for some 
A;2 e N that 

2 1 

3fci+fc2 ^ p 3^1 ^ 3fci+fc2-i ^ ^ 

or equivalently 

3^2 ( 2p 3p 

V3^i -2p' 3^1 -2p^ 

Thus, any prime numbers which satisfy (6) but for which there is no power of 3 in the interval 
(3^r=2^' 3^r=2;) ^S^i^ excluded, e.g., 37, 113, 331, 337, 353, 991, 997, 1009. 
Continuing in this way, the condition for the third non-zero digit to be a 2 is 

3^3 e ( '^P 3p \ 

\^3fc2(3fci _ 2p) - 2p' 3^=2 (3'=i -2p)-2pJ ^ ' 

and the condition for the nth non-zero digit to be a 2 is 



2p 3p 



3fcn-i (. . . (3fe2 (3fci _ 2p) - 2p) • ■ ■ ) - 2p ' 3'="-! (■ ■ ■ (3*^2 (3^1 - 2p) - 2p) ■ ■ ■) - 2p^ 

(10) 

The ternary expansions under consideration are all non-terminating, so at first sight it 
seems as if an endless sequence of tests like these would have to be applied to ensure that 
ttfc 7^ 1 for any k E N. However, this is not the case: (6) and (10) capture all the information 
that is required. To see this, let p be a Cantor prime and let 3^^^ be the smallest power of 
3 that exceeds 2p. Since p is a Cantor prime, both (6) and (10) must be satisfied for all n. 
Multiplying (10) through by 3^^^"^^" we get 



gfcl 



3fci-fe„ . 2p ski-k„ . 3p 



3/=n-i (. . . (3^2 (3fci _2p)-2p)--- )-2p' 3^-1 (■ ■ ■ (3^=2 (3^1 _ 2p) - 2p) ■ ■ ■ ) - 

(11) 

Given 3'^^ G (2p, 3p), and the fact that (11) must be consistent with this for all values of n, 
we must have 3^"^ — 2p = 1 in (11). Since there can only be one power of 3 in {2p,3p), (8) 
then implies that 3'^^ = 3^'^, (9) implies 3^^^ = 3^^^ and so on, so we must have ki = kn for all 
n. Otherwise, if 3''^ — 2p > 1, it is easily seen that 3^"-^{- ■ ■ {3''^ {3^^ — 2p) — 2p) ■ ■ ■) goes to 
infinity as n — 00. This is because (8) implies 3^^ > + 1, so 3^^ {3^^ - 2p) > 3'=^ with 
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equality only if 3''' - 2p = 1. Therefore with 3''' - 2p > 1, (8) implies 3^=2 (3^=1 - 2p) > 3''\ 
then (9) implies 3^^^ (3^^^ (3''"^ — 2p) — 2p) > 3'^^(3'^^ — 2p), and so on. Since the numerators 
in (11) are bounded above by 3^^^ • 3p, there must be a value of n for which the interval in 
(11) will lie entirely to the left of {2p,3p), thus producing a contradiction between (6) and 
(11). It follows that we cannot have 3^'^ — 2p > 1, so if p is a Cantor prime we must have 
2p + 1 = 3^^^ as claimed. 

We note that the primality of p plays a role in the above in that it prevents 2p having 
3 as a factor, which would make 3'^ — 2p = 1 impossible. Since 3'^ — 2p = p (mod 3), we 
deduce that only prime numbers of the form p = 1 (mod 3) can be Cantor primes. Primality 
in itself is not necessary, however. The theorem also encompasses non-prime numbers of the 
form X = 1 (mod 3). An example is 4, which satisfies the equation 2a: + 1 = 3^ with y = 2, 
and we find | G C3. 

Next we use a standard approach to show that q in 2p + 1 = 3'^ must be prime if p is 
prime [6]. To see this, note that if g = rs were composite we could obtain an algebraic 
factorisation of 3"^ — 1 as 

39 - 1 = (3")' - (1)* = (3'" - 1)(3(^-^^" + 3(^-2)" + ■ ■ ■ + 1) (12) 

We would then have 

p = = i^l_ll(3(-i>' + 3^^-'> + ■■■ + !) (13) 

Since 2|(3'' — 1), this would imply that p is composite which is a contradiction. Therefore q 
must be prime. 

Finally we prove that if p satisfies an equation of the form 2p + 1 = 3^ then it must be a 
Cantor prime. This can be done by simply putting = 3 in (1) and (3). This shows that 
^ can be expressed in base 3 using only zeros and the digit 2, which will appear periodically 
in the base-3 representation at positions which are multiples of q. Since only zeros and the 
digit 2 appear in the ternary representation of ^, ^ is never removed in the construction of 
C3, so p must be a Cantor prime as claimed. 

3 Uniqueness of the Base-3 Case 

In the case of the Mersenne primes, corresponding to A = 2, the first four stages in the 
construction of the middle-half Cantor set C2 involve removing the open middle-half intervals 
(i' |)' (^1 ii) (2^6' jfe) among others. These contain the first four Mersenne- 

prime reciprocals |, 7, ^ and respectively, so it is clear that Mersenne primes cannot 
be characterised in the same way as Cantor primes. Is the characterisation likely to hold for 
N > 37 We conjecture that the answer is no. 

Conjecture 1. It is possible to characterise base-N repunit primes as primes whose recip- 
rocals belong to Cn (and vice versa) only in the case N = 3. This characterisation does not 
hold for N ^ 3. 
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Although we cannot provide a formal proof of this conjecture, we offer some counterex- 
amples for > 3 and some intuitive arguments. It is relatively easy to find counterexamples 
showing that there are base-A^ repunit primes p for A^ > 3 such that - does not belong to 
Cn- One such counterexample for = 4 is the prime number p = 5. This satisfies (1) with 
A^ = 4 and q = 2, so it is a base-4 repunit prime. However, its reciprocal | does not belong to 
the middle-quarter Cantor set. The first stage in the construction of C4 involves the removal 
of the middle-quarter interval (|, |) from [0,1]. The second stage involves the removal of 
the middle-quarter interval (^,||) from [0, |]. The removed interval (^,||) contains |. 
A counterexample for = 5 is the number p = 31 which satisfies (1) with A = 5 and 
g = 3. It is therefore a base-5 repunit prime. However, its reciprocal ^ does not belong to 
the middle- fifth Cantor set. It can be shown straightforwardly that the fourth stage in the 
construction of C5 involves the removal of the open interval ^) which contains ^. 

Going in the other direction, we can argue intuitively that any primes whose reciprocals 
belong to Cn ioi N > 3 are unlikely to be base- A repunit primes. This is because, for A > 3, 
the middle- A^th Cantor set Cn is not characterised by the fact that all its elements can be 
represented in base in a way that involves only zeros and one non-zero digit, — 1. Thus, 
for an arbitrary prime number p whose reciprocal is contained in C^, it is possible for the 
base-A^ representation of ^ G Cn to involve non-zero digits other than A — 1. Therefore 
p will not generally satisfy (1) because (1) implies a base-A^ representation involving only 
zeros and the digit A^ — 1 as shown in (3). 
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